Search CORE

34 research outputs found

Workshop Proceedings of the 12th edition of the KONVENS conference

Author: Faaß Gertrud
Ruppenhofer Josef
Publication venue: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Publication date: 11/07/2023
Field of study

Publikationsserver des Instituts für Deutsche Sprache

Devices for Information Presentation in Electronic Dictionaries*

Author: Bothma T
Faaß Gertrud
Heid U
Prinsloo DJ
Publication venue: 'African Journals Online (AJOL)'
Publication date: 23/01/2013
Field of study

Electronic dictionaries should support dictionary users by giving them guidance in text production and text reception, alongside a user-definable offer of lexicographic data for cognitive purposes. In this article, we sketch the principles of an interactive and dynamic electronic dictionary aimed at text production and text reception guiding users in innovative ways, especially with respect to difficult, complicated or confusing issues. The lexicographer has to do a very careful analysis of the nature of the possible problems to suggest an optimal solution for a specific problem. We are of the opinion that there are numerous complex situations where users need more detailed support than currently available in e-dictionaries, enabling them to make valid and correct choices. For highly complex situations, we suggest guidance through a decision tree-like device. We assume that the solutions proposed here are not specific to one language only but can, after careful analysis, be applied to e-dictionaries in different languages across the world. Keywords: Electronic Dictionaries; User Guidance; Text Production; Text Reception; Dictionary Design, Decision Tree Structure, Copulatives, Kinship Terminology, Information Presentation Device

AJOL - African Journals Online

HiER 2015 - Proceedings des 9. Hildesheimer Evaluierungs- und Retrievalworkshop

Author: Elbeshausen Stefanie
Faaß Gertrud
Griesbaum Joachim
Heuwing Ben
Jürgens Julia
Publication venue: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Publication date: 18/07/2023
Field of study

Dieser Band fasst die Vorträge des 9. Hildesheimer Evaluierungs- und Retrieval-Workshops (HIER) zusammen, der am 9. und 10. Juli 2015 an der Universität Hildesheim stattfand. Die HIER Workshop-Reihe begann im Jahr 2001 mit dem Ziel, die Forschungsergebnisse der Hildesheimer Informationswissenschaft zu präsentieren und zu diskutieren. Mittlerweile nehmen immer wieder Kooperationspartner von anderen Institutionen teil, was wir sehr begrüßen. HIER schafft auch ein Forum für Systemvorstellungen und praxisorientierte Beiträge

Publikationsserver des Instituts für Deutsche Sprache

IGGSA Shared Tasks on German Sentiment Analysis (GESTALT)

Author: Faaß Gertrud
Klinger Roman
Ruppenhofer Josef
Ruppenhofer Josef
Sonntag Jonathan
Struß Julia Maria
Wiegand Michael
Publication venue: Universität Heidelberg
Publication date: 01/01/2014
Field of study

Ruppenhofer J, Klinger R, Struß JM, Sonntag J, Wiegand M. IGGSA Shared Tasks on German Sentiment Analysis (GESTALT). In: Faaß G, Ruppenhofer J, eds. Workshop Proceedings of the 12th Edition of the KONVENS Conference. Hildesheim, Germany: Universität Heidelberg; 2014: 164-173

Publications at Bielefeld University

From <tiger2/> to ISOTiger – Community Driven Developments for Syntax Annotation in SynAF

Author: Bosch Sonja
Eckart Kerstin
Faaß Gertrud
Heid Ulrich
Lee Kiyong
Pareja-Lora Antonio
Pretorius Laurette
Romary Laurent
Witt Andreas
Zeldes Amir
Zipser Florian
Publication venue: HAL CCSD
Publication date: 12/12/2014
Field of study

International audienceIn 2010, ISO published a standard for syntactic annotation, ISO 24615:2010 (SynAF). Back then, the document specified a comprehensive reference model for the representation of syntactic annotations, but no accompanying XML serialisation. ISO's subcommittee on language resource management (ISO TC 37/SC 4) is working on making the SynAF serialisation ISOTiger an ad-ditional part of the standard. This contribution addresses the current state of development of ISOTiger, along with a number of open issues on which we are seeking community feedback in order to ensure that ISOTiger becomes a useful extension to the SynAF reference model

INRIA a CCSD electronic archive server

Eine korpuslinguistische Untersuchung der Sepedi-Negation für die Lexikographie

Author: Faaß Gertrud
Publication venue: 'Stellenbosch University'
Publication date: 02/05/2023
Field of study

So far, Sepedi negations have been considered more from the point of view of lexicographical treatment. Theoretical works on Sepedi have been used for this purpose, setting as an objective a neat description of these negations in a (paper) dictionary. This paper is from a different perspective: instead of theoretical works, corpus linguistic methods are used: (1) a Sepedi corpus is examined on the basis of existing descriptions of the occurrences of a relevant verb, looking at its negated forms from a purely prescriptive point of view; (2) a "corpus-driven" strategy is employed, looking only for sequences of negation particles (or morphemes) in order to list occurring constructions, without taking into account the verbs occurring in them, apart from their endings. The approach in (2) is only intended to show a possible methodology to extend existing theories on occurring negations. We would also like to try to help lexicographers to establish a frequency-based order of entries of possible negation forms in their dictionaries by showing them the number of respective occurrences. As with all corpus linguistic work, however, we must regard corpus evidence not as representative, but as tendencies of language use that can be detected and described. This is especially true for Sepedi, for which only few and small corpora exist. This paper also describes the resources and tools used to create the necessary corpus and also how it was annotated with part of speech and lemmas. Exploring the quality of available Sepedi part-of-speech taggers concerning verbs, negation morphemes and subject concords may be a positive side result.Bisher wurden Sepedi Negationen eher aus der Sicht der lexikographischen Behandlung betrachtet. Hierfür wurden theoretische Werke über Sepedi verwendet, wobei als Zielsetzung eine saubere Beschreibung dieser Negationen in einem (Papier-)Wörterbuch gesetzt wurde. Dieser Beitrag ist aus einer anderen Perspektive: statt theoretischer Werke werden korpuslinguistische Methoden eingesetzt: (1) ein Sepedi Korpus wird auf Basis bestehender Beschreibungen zu den Vorkommen eines einschlägigen Verbs untersucht und dabei seine negierten Formen aus rein präskriptiver Sicht betrachtet; (2) wird eine "corpus-driven"-Strategie eingesetzt, bei dem nur nach Sequenzen von Negationspartikeln (oder Morphemen) gesucht wird, um vorkommende Konstruktionen auflisten zu können, ohne dabei die dabei vorkommenden Verben — abgesehen von ihrer Endung — zu berücksichtigen. Der Ansatz in (2) soll dabei nur eine mögliche Methodik aufzeigen, um bestehende Theorien über vorkommende Negationen erweitern zu können. Wir möchten auch versuchen, Lexikographen darin zu unterstützen, eine frequenzbasierte Reihenfolge der Einträge möglicher Negationsformen in ihren Wörterbüchern aufzustellen, in dem wir ihnen die Anzahl der jeweiligen Okkurrenzen aufzeigen. Wie bei allen korpuslinguistischen Arbeiten müssen wir jedoch Korpusevidenz nicht als repräsentativ ansehen, sondern als Tendenzen des Sprachgebrauchs, die festgestellt und beschrieben werden können. Dies gilt insbesondere für Sepedi, für das nur wenige und kleine Korpora existieren. Dieser Beitrag beschreibt außerdem die Ressourcen und Werkzeuge, die verwendet wurden, um das nötige Korpus zu erstellen und auch, wie dieses mit Wortart und Grundformen der Wörter angereichert wurde. Ein Nebenergebnis ist dabei die Untersuchung der Qualität von verfügbaren Taggern bzgl. Verben, Negationsmorphemen und Kongruenzpartikel

Publikationsserver des Instituts für Deutsche Sprache

The verbal phrase of Northern Sotho: A morpho-syntactic perspective

Author: Faaß Gertrud
Publication venue: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Publication date: 10/05/2023
Field of study

So far, comprehensive grammar descriptions of Northern Sotho have only been available in the form of prescriptive books aiming at teaching the language. This paper describes parts of the first morpho-syntactic description of Northern Sotho from a computational perspective (Faaß, 2010a). Such a description is necessary for implementing rule based, operational grammars. It is also essential for the annotation of training data to be utilised by statistical parsers. The work that we partially present here may hence provide a resource for computational processing of the language in order to proceed with producing linguistic representations beyond tagging, may it be chunking or parsing. The paper begins with describing significant Northern Sotho verbal morpho-syntactics (section 2). It is shown that the topology of the verb can be depicted as a slot system which may form the basis for computational processing (section 3). Note that the implementation of the described rules (section 4) and also coverage tests are ongoing processes upon that we will report in more detail at a later stage

Publikationsserver des Instituts für Deutsche Sprache

Special issue on challenges in computational linguistics, empiric research & multidisciplinary potential of German song lyrics

Author: Faaß Gertrud
Schneider Roman
Publication venue: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Publication date: 05/06/2023
Field of study

Publikationsserver des Instituts für Deutsche Sprache

A computational implementation of the Northern Sotho infinitive

Author: Faaß Gertrud
Prinsloo DJ
Publication venue: NISC Pty Ltd
Publication date: 07/04/2012
Field of study

The aim of this article is to describe the infinitive in Northern Sotho based on corpus data and the respective literature; so far, all share the same view: The infinitive is a noun (of class 15) and a verb at the same time – ‘it manifests both nominal as well as verbal features’ (Poulos & Louwrens, 1994:42). When implementing these constellations in a parser, however, a new perspective is found: to achieve its successful implementation, the  infinitive must be defined as a verb on the one hand and as a noun of class 15 on the other, derived from this verb through nominalization (transposition). Instead of a subject concord, the verb stem in the infinitive is preceded by the respective class prefix.S.Afr.J.Afr.Lang., 31(2) 201

AJOL - African Journals Online

Proceedings of the 12th edition of the KONVENS conference

Author: Faaß Gertrud
Ruppenhofer Josef
Publication venue: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Publication date: 20/07/2023
Field of study

The 2014 issue of KONVENS is even more a forum for exchange: its main topic is the interaction between Computational Linguistics and Information Science, and the synergies such interaction, cooperation and integrated views can produce. This topic at the crossroads of different research traditions which deal with natural language as a container of knowledge, and with methods to extract and manage knowledge that is linguistically represented is close to the heart of many researchers at the Institut für Informationswissenschaft und Sprachtechnologie of Universität Hildesheim: it has long been one of the institute’s research topics, and it has received even more attention over the last few years. The main conference papers deal with this topic from different points of view, involving flat as well as deep representations, automatic methods targeting annotation and hybrid symbolic and statistical processing, as well as new Machine Learning-based approaches, but also the creation of language resources for both machines and humans, and methods for testing the latter to optimize their human-machine interaction properties. In line with the general topic, KONVENS-2014 focuses on areas of research which involve this cooperation of information science and computational linguistics: for example learning-based approaches, (cross-lingual) Information Retrieval, Sentiment Analysis, paraphrasing or dictionary and corpus creation, management and usability

Publikationsserver des Instituts für Deutsche Sprache